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Abstract — In this paper the chaotic properties of the 
TCP congestion avoidance mechanism are investigated. The 
analysis focuses on the origin of the complex behavior ap- 
pearing in deterministic TCP/IP networks. From the traffic 
modeUng point of view the understanding of the mechanism 
generating chaos is essential, since present models are un- 
able to cope with this phenomena. 

Using the basic tools of chaos theory in our study, the 
main characteristics of chaotic dynamics are revealed. The 
dynamics of packet loss events is studied by a simple sym- 
boUc description. The cellular structure of the phase space 
of congestion windows is shown. This implies periodic be- 
havior for large time scales. 

Chaotic behavior in short time scales and periodicity for 
larger times makes it necessary to develop models that ac- 
count for both. Thus a simple model that describes the con- 
gestion window dynamics according to fluid equations, but 
handles the packet loss events separately is introduced. This 
model can reproduce the basic features observed in reaUstic 
packet level simulations. 



L Introduction 

Recently Veres and Boda have demonstrated that 
TCP congestion control can be chaotic in certain circum- 
stances. Chaos in practice means that TCPs influenc- 
ing each other in a computer network can produce highly 
complex behavior in time which is sensitive to small per- 
turbations, yet the equations describing it are determinis- 
tic and simple. Understanding the mechanism and equa- 
tions that produce chaos in a TCP/IP network is crucial 
in traffic modeling. If we can identify the details of the 
mechanism then it is possible to build new kinds of net- 
work traffic models where the factors important from the 
point of view of dynamics and chaos are kept while many 
unimportant factors are reduced or simplified. Current 
TCP models are either too simplistic by assuming periodic 
behavior or they are entirely stochastic disregarding small 
details of a given network which might alter the dynam- 
ics completely and even change its statistical properties. 



Stochastic models can also break the temporal and spa- 
tial (topological) structure of correlations existing among 
TCPs in an extended network and are not able to give a 
correct account of possible long range dependence born 
between far away TCPs [^. On the other hand, new chaos 
aware TCP and network models can preserve correlations 
and show the same level of complexity as it is observed in 
real network traffic. These features might turn out to be 
unavoidable when building reliable models of large net- 
works where packet level simulation requires astronom- 
ical computer resources and/or simulation time. More- 
over, chaos theory itself provides new tools to quantify 
this complexity. The complexity of a real network traf- 
fic or its packet level simulation and the complexity of 
a traffic model can be compared. Despite their inherent 
simplicity, these tools have never been used in network- 
ing and they might open new prospects in verifying TCP 
and network models or can even characterize the actual 
network state. 

We think that all these issues are so important, that we 
should answer at least the basic open questions concern- 
ing the chaotic state of the TCP congestion control mech- 
anism. For example we still do not know what causes 
TCP chaos exactly. Accordingly we do not know how 
generic its appearance is. For example, it has been argued 
in Ref. [||] that in the chaotic examples shown in Ref. [jl|] 
the packet loss probability is considerably higher than 1%. 
It has been shown [|3|-|5|] that in this case the exponential 
backoff mechanism plays an important role and can be re- 
sponsible for the complex congestion window dynamics. 
We are going to show that chaos is not a consequence of 
high network congestion or loss and that TCPs operating 
in congestion avoidance mode, never entering into a back- 
off state show chaos. In other words chaos is the generic 
behavior of many TCP systems and periodicity and syn- 
chronization is rather exceptional. 

Also we have to ask how we can distinguish TCP chaos 
from stochasticity and do we gain anything by doing that. 
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To point out the main weakness of stochastic models and 
to call for chaos aware models we would like to show 
that current stochastic TCP models are even unable to pre- 
dict the traffic in a simple scenario like the one shown in 
Fig. H and investigated throughout this paper. In this setup 



(~ 7 • 10"^). The ' 'winner" TCP behaves seemingly peri- 
odically, while the "losers" seem to be erratic. Obviously, 




Fig. 1 

Investigated network model: There are three TCP 
flows sharing a common buffer that can store b 
packets and a common line with delay do and speed co- 
Then the common link splits into three different lines 

WITH different DELAYS AND SPEEDS. IN THE ACTUAL 
simulations B = 100, To = 400 ms, Ti = 100 ms, 
T2 = 150 ms, T3 = 200 ms, Co = 10** bps AND 
Ci = C2 = C3 = 10^ bps HAS BEEN CHOSEN. 



three TCP flows sharing a common buffer that can store 
B packets and a common line with delay Tq and speed 
Co, then splitting into three different lines with different 
delays and speeds. In the actual simulations B = 100, 
To = 400 ms, Ti = 100 ms, T2 = 150 ms, Ts = 200 ms, 
To = 10*5 bps and Ci = C2 = C3 = 10^ bps has been 
chosen. The congestion windows of the competing TCPs 
are not limited by the senders or by the receivers. The 
injected TCP packets can be lost only at the bottleneck 
buffer, there are no random losses on the links or in other 
buffers. It follows that the traffic is controlled by strict de- 
terministic rules, that is by the TCP Reno algorithm 
Numerical simulations were carried out by Network Sim- 
ulator (ns) version 2b5 [^. 

Based on ideas brought from stochastic TCP modeling 
a common belief is that TCP is biased against long round 
trip time connections and the throughputs are proportional 
to ~ This assumption has been proven to be very 

good in the presence of random elements in the simulation 
[^. In reality, shown in Fig. 2(a)| it can be observed that 
the congestion window corresponding to the largest round 
trip time is significantly larger than the others. This TCP 
obtains unfairly higher throughput than the other two. The 
packet loss rate of the preferred TCP (~ 5 • 10~^) is also an 
order of magnitude less than that of the suppressed ones 
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Fig. 2 

(A) A TYPICAL PART OF THE CONGESTION WINDOW TIME SERIES 
IS SHOWN FOR THE SIMULATION OF THE SCENARIO OF FlG.[l|. (B) 
The EFFECT OF A SMALL PERTURBATION AT t = 120 S ON THE 
CONGESTION WINDOW DEVELOPMENT. 



there is something here which is missed completely by 
the stochastic model. Next, we show that this simple sce- 
nario is already chaotic and the deterministic nature of 
packet losses cannot be disregarded if we would like to 
build models that correctly predict the temporal behavior 
of congestion windows. 

II. The TCP butterfly effect 

The complete state of a TCP can be given by a number 
of internal variables at any moment [^. Such variables 
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are the congestion window, the slow start threshold, the 
retransmission timeout, the backoff counter, the duplicate 
ACK counter, and so on. However, during the optimal 
operation of TCP, in congestion avoidance mode, a single 
variable can be selected which controls almost completely 
the behavior of the TCP: that is the congestion window. 
This variable has also practical importance, since it limits 
the maximum number of unacknowledged packets sent by 
a TCP into the network. 

To demonstrate how sensitive this system can be for 
small perturbations the same simulation has been run for 
120 s and then a perturbation of 6wi{0) = 0.01 has been 
added to all the congestion window values at t = 120 s. 



The result is shown in Fig. |2(b)| . The congestion windows 
remain unchanged until t = 120 s. Then the difference 
between the congestion windows of the two simulations 
|(5w(t)| remains the same (~ 0.01) until the first packet 
drop event. Then one of the underprivileged TCPs, whose 
packet has been lost, halves its window. As a result, the 
distance between the original and the perturbed trajecto- 
ries grows about an order of magnitude, since the owner of 
the lost packet differs in the original and in the perturbed 
simulations. At this point it seems that the dominant TCP 
is not affected at all. Finally, around t = 240 s the time 
evolution of the dominant TCP diverges completely from 
the original trace due to a permutation of packets, result- 
ing in a loss event for the dominant TCP. As we can see the 
rest of the simulation differs from the original one. Such 
sensitivity against small perturbations is called the butter- 
fly effect in chaos theory and gives us the first clue that 
this system operating in congestion avoidance is actually 
chaotic. Next we introduce a few basic tools which help 
us to characterize the chaotic state. 

III. Characterizing chaos 

One of the most basic tools of chaos theory in vi- 
sualizing the dynamics is the Poincare surface of sec- 
tion. Instead of looking at the continuous time evolu- 
tion of trajectories one can select a surface in the phase 
space and watch only when the trajectories cross that sur- 
face. In case of TCP congestion window trajectories the 
evolution between two packet loss events is fairly sim- 
ple. All the interesting things happen at the times of 
packet losses. Therefore the values of congestion win- 
dows taken at the moments of packet losses of any of 
the TCPs is a natural choice for a surface of section in 
general. In Fig. ^ this surface of section is shown for 
our system. One can see that the congestion window 
triplets {wi,W2,W3) taken at times of packet losses ap- 
proximately form a two dimensional surface within the 
three dimensional congestion window phase space. It is 




Fig. 3 

Poincare section oe the phase space. Congestion 
window values egr the three tcps at packet loss times. 
Colours depend on which TCP lost a packet. The 

DEPICTED (PINK) SUREACE IS WHERE THE SUM OE CONGESTION 
WINDOWS IS APPROXIMATELY EQUAL WITH THE MAXIMUM 
NUMBER OF PACKETS WHICH CAN TRAVEL ON THE LINES OR STAY 
IN THE BUEFER. 



easy to understand why we get such a surface: packet 
losses occur when the buffer is full. In the scenario of 
Fig. |I]the packets first fill the lines denoted by 0, 1, 2 and 
3. The number of packets which can travel on these lines 
is given by the bandwidth delay products divided by the 
packet size CoTj/P, where the packet size in our simu- 
lations was 512 bytes. The maximum number of packets 
on the lines and in the buffer might be approximated by 
Q = B + CoTo/P + iCo(ri +T2 + T3)/P, as all pack- 
ets go through link 0, while each packet should choose 
either one of the three lines 1, 2 or 3. The sum of conges- 
tion windows W = Wi is approximately the number 
of packets in the network and packet loss occurs approxi- 
mately when W = Q. This equation defines the "surface 
of loss" inside the window phase space. This surface is 
also indicated in Fig. ^ 

In chaotic systems the attractor is often a fractal ob- 
ject. Fractals are statistically self-similar geometric ob- 
jects that might be characterized by suitably defined non- 
integer valued dimensions. For ordinary fractals this di- 
mension is less then the Eucledian embedding dimension 
D of the object [^. The fractal dimension of the points 
on the surface of section can be measured. To do this we 
can project the points onto a suitable surface. The points 
were projected onto the — const, surface and the 

fractal dimension of this two dimensional projection was 
measured. A usual method for measuring the fractal di- 
mension is when a grid of cells of size e is put on the ob- 
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ject, and the number of non-empty cells A'^(e) is counted. 
The box counting dimension is given by 



Dn 



e^O log e 



(1) 



In Fig. it can be seen that the points on the surface 
form a non-trivial fractal with box counting dimension 
Do = 1.69 ± 0.02. 



\ Do = 1.69 ±0.02 



1 

logio(f) 



Fig. 4 

Fractal dimension of the two dimensional projection of 
the surface of section of fig. |. 



Now our qualitative picture of TCP dynamics in con- 
gestion avoidance mode can be summarized as follows. 
Congestion windows steadily grow between packet losses. 
This process can usually be well approximated [|^, ^ 
with fluid equations of the type 



dwjjt) 
dt 



1 



(2) 



where T^Ti,i{^) is the round trip time of the zth TCP. 
Round trip times can also be approximated as functions 
of the congestion windows and then the resulting differ- 
ential equations (^) can be solved self-consistently. The 
sum of congestion windows also grows steadily and the 
congestion windows cut the surface of loss at some point. 
Then one of the congestion windows is halved according 
to the TCP algorithm and the position of the point in the 
phase space drops below the loss surface and the process 
starts again. 

IV. Symbolic description 

One can see that the dynamical process between packet 
losses described above is relatively simple. The compli- 
cated fractal structure of the attractor and the sensitiv- 
ity for perturbations should come from the fine details of 



the packet loss process. Each time the congestion win- 
dow trajectory crosses the loss surface a packet loss event 
happens in one of the TCP flows. One of the symbols 
Si = {1, 2, 3} can be assigned to the zth packet loss de- 
pending on which TCP lost the packet. We can consider 
the sequence of the symbols . . . . . . generated 

by the time evolution of our TCPs. This symbol sequence 
is a coding of the real congestion window evolution. Such 
symbolic coding plays an important role in the theoreti- 
cal description of chaotic systems. When an infinite se- 
quence of symbols codes exactly one or zero real space 
trajectory the coding is called Markov partition. In this 
case the symbol sequences uniquely code the chaotic dy- 
namics. An equivalent definition of the Markov partition 
is when each periodic symbol sequence codes exactly one 
or zero periodic orbit of the chaotic system. 

In the case of TCP congestion avoidance mode we 
can demonstrate that the introduced symbols form a 
Markov partition. If an infinite periodic sequence (such 
as . . . 122312231223 . . . ) is prescribed and two differ- 
ent initial congestion window triplets {{wi,W2,W2,) and 
{w'i,w'2,w'.^)^ are taken and they evolve according to the 
equations they will reach the loss surface at different 
points. One can prove that the equations are linearly 
stable, so they are not capable to increase the difference 
between two orbits. When one of the windows is halved 
after the trajectory crosses the loss surface, then the differ- 
ence between the orbits is halved in that direction while it 
remains unchanged in other directions. The time evolu- 
tion according to Eq. and the prescribed halvings will 
decrease the distance between the two orbits in each pe- 
riod. As all the TCPs should halve their windows at least 
once in each cycle the distance between the two initial 
triplets is at least halved in each period. This way we can 
see that two trajectories started from different initial con- 
ditions converge exponentially to a common periodic or- 
bit. In the end we get a unique periodic orbit correspond- 
ing to a given periodic code. So far we forced a given TCP 
to halve its window according to the prescribed symbol. 
In the end we can look at the actual packet flow generated 
by the window evolution. The prescribed periodic orbit 
can be feasible if the resulting packet flow is in accor- 
dance with the prescribed packet loss sequence or it is not 
feasible if the resulting packet flow generates losses in a 
different order than it has been assumed. This way we can 
decide if the calculated periodic orbit exists or not. This 
procedure ensures that we assign one or zero real periodic 
orbits to a periodic sequence and proves the existence of 
the Markov partition. 
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V. The tool-box of chaos 

The statistical theory of chaos [ p^ is based on the sym- 
bol sequences introduced above. If the dynamics is regu- 
lar (non-chaotic) then the dynamics is periodic or quasi 
periodic while the most important characteristics of chaos 
is that it endlessly generates topologically different new 
trajectories. This is reflected in the way different systems 
generate symbol sequences. A length n symbol sequence 
can continue in many ways; in average with a number of 
symbols to form a length n + I sequence. Thus the num- 
ber of length n + 1 sequences N{n + 1) can be expressed 
as 

N{n + 1) ^ aN{n), (3) 

when the length n is large. In chaotic systems there is 
more than one possibility to continue a sequence in aver- 
age and a > 1 while in regular systems a = 1 for long 
sequences. Consequently in chaotic systems the number 
of possible length n symbols grows exponentially 

N{n) r^a"" e^o", (4) 

and the quantity Kq = ln(a) > is the topological en- 
tropy. In non-chaotic systems the number of sequences 
grows sub-exponentially and the topological entropy is 
zero. 

To prove that our TCP system really produces chaos we 
can measure its topological entropy. In our case we can 
have a maximum of 3" different symbolic sequences of 
length n. Of course not all of them are realized by the dy- 
namics since some of them are impossible. For example 
infinite sequences consisting of only one or two symbols 
are excluded since this would imply that some of the con- 
gestion windows are never halved. In Fig. ^ we show the 
number of realized symbol sequences for different lengths 
n up to n = 12. The measurement has been carried out by 
logging out the symbols (i.e. the index of the TCP which 
lost a packet) from an n s simulation of the system. We 
generated a sequence of 150.000 consecutive symbols and 
determined how many different length n sequences exist 
in it. 

We observed that in average a = 2.586 symbols can 
follow a given symbol sequence and the topological en- 
tropy is Kq k, 0.953. This shows that our system is 
strongly chaotic as the number of sequences grows with a 
large exponent, yet it is markedly different from a stochas- 
tic system, where all combination of symbols are allowed 
and would result in a = 3 and a topological entropy of 
In 3. 

The procedure described so far measures the existence 
of different symbol sequences only. We can charac- 
terize chaos further by calculating also the probability 
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Fig. 5 

Number of different symbols as a function of symbol 
length for our system on a semi-logarithmic plot. the 

FITTED LINE IS N (n) = 1.669 X 2.586" 



P{Si, 5*2, .. . , Sn) of the occurrence of the symbol se- 
quence Si, S2, ■ ■ ■ , Sn- In a system with L symbols 
(Si = {1,2,... ,L}) we can visualize this probability 
distribution by assigning the number 

n 

x = Y.^S,- 1)L-' (5) 
1=1 

to each symbol sequence and plotting P{x). In fact < 
X < 1 is the L-ary fractional representation of the number 
represented by the symbols. 

In our system we carried out this analysis and the re- 
sult is plotted in Fig. ^. It can be clearly seen that the 

0.009 I 1 1 1 1 1 1 1 1 



p 0.004 - 








0.125 0.25 0.375 0.5 0.625 0.75 0.875 1 
Symbols 

Fig. 6 

Probability distribution of symbols of length 10 for 
our system. 

probability distribution is a fractal in the space of sym- 
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bols. In fact, in all chaotic systems we should observe a 
multifractal distribution and the topological entropy intro- 
duced above is related to the box counting dimension of 
this representation. If we cut the [0, 1[ interval into boxes 
of size e = 1/L" then the number of non-empty boxes is 
the number of existing symbols N{n). The box counting 
dimension is then 

Do= lim logA^(n)/logL" = i^o/lnL. 

n— >oo 

In our case then the box counting dimension of the mul- 
tifractal of Fig. ^ is 0.864. Note, that this box counting 
dimension is not related to the box counting dimension of 
the attractor discussed before. 

Scaling of moments of the probability distribution give 
further characterization of the multifractal properties in 
the symbol space. We can define the Renyi entropies : 



lim 



1 1 



+00 n 1 



(6) 



{Sin 



where summation {S},^ goes over all possible symbol se- 
quences of length n. The quantity Kg/lnL again mea- 
sures the Dq generalized dimension of the multifractal 
spectrum of the P{x) histogram. The most important en- 
tropy is the Kolmogorov-Sinai entropy Ki = limg^i Kg 
which gives the scaling of the Shannon entropy of the 
probability distribution: 

Ki{n) = -Y,P{Si,S2,... ,Sn)lnP{SuS2,... 

(V) 



Ki = lim -Ki{n). 

n-»oo n 



(8) 



In Fig. the Kolmogorov-Sinai entropy is measured 
for our system. 

The Kolmogorov-Sinai (KS) entropy in a chaotic sys- 
tem is also related to the Lyapunov exponent A of the map- 
ping of the Poincare section onto itself. If we consider two 
nearby trajectories on the Poincare section — which is the 
loss surface in our case — then their initial separation in 
the phase space 6wo grows each time the trajectories re- 
visit the section. After n revisits the distance grows expo- 
nentially 5wn ~ e''^"'5wQ where the average of the expo- 
nent lambda (A) is the Lyapunov exponent of the Poincare 
section. The KS entropy gives the Lyapunov exponent 
Ki = (A) in our system. The positivity of the KS en- 
tropy is another indication of chaos and the exponential 
sensitivity for the perturbation of trajectories. 

Since the structure of the Markov partition for TCPs is 
simple even for larger networks, the topological and KS 
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Ki = 0.885 ± 0.003 
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Fig. 7 

I ENTRO 

Ki{n) = ~J2ZiPi = 0-885 x n + 0.078 



Kolmogorov-Sinai entropy for our system. 

^3" 



entropies are easy to measure in a simulation or can even 
be determined in a real network. These quantities mea- 
sure the complexity of the dynamics and by evaluating 
them we can quantify complexity and evaluate TCP and 
network models in general. 

VI. Cellular chaos 

The results so far confirmed the hypothesis of chaotic 
dynamics. However, the variables of TCP in reality are 
discrete and not continuous. We can demonstrate this by 
applying such a small perturbation to congestion windows 
by which we do not change the loss events happening in 
the system. To investigate this we perturbed the trajec- 



tory of Fig. [2(a)| by a small vector Sw and traced the dif- 
ference between the original and the perturbed trajectory. 
We found that there exists a set of perturbations, shown in 
Fig. H, which vanishes after all windows are halved. The 
maximal perturbation of this type was found to be approx- 
imately |(5w| < 0.001. There is a well defined neighbor- 
hood around every phase point which defines the the same 
behavior of loss dynamics. That is, after all the conges- 
tion windows are halved, the time evolution of congestion 
windows becomes identical, mainly because the algorithm 
sets the window to its integer part. This means that the 
phase space is not continuous. It is divided into small "at- 
tractive cells" in which trajectories converge in finite time. 
If a trajectory gets into the cell of another trajectory then 
they follow the same trajectory later on. 

This property makes TCP chaos very interesting from a 
theoretical point of view, since we have a globally chaotic 
dynamics while the fine details of the system are non- 
chaotic. This is not typical in natural occurrences of chaos 



7 



basin of attraction 




Fig. 8 

Our trial TCP trajectory (green) and the points in 

PHASE SPACE (RED) WHICH FOLLOW THE SAME TRAJECTORY 

AFTER ALL THREE WINDOWS ARE HALVED. THE BASIN OF 
ATTRACTION OF THE STARTING POINT OF THE TRAJECTORY IS 
APPROXIMATELY A CUBE OF SIZE 0.001^ . 



but it is potentially important in engineered systems. The 
most significant aspect of the cellular structure is that all 
trajectories should be periodic. If a trajectory re-enters 
a cell visited previously then according to the attractive 
nature of the dynamics it will follow the same trajectory 
again and will repeat itself. Since the phase space can be 
divided into a finite number of cells a trajectory should at 
least repeat itself after visiting all the possible cells. In re- 
ality the repetition of a cell happens much earlier We can 
make an estimate of the typical length and the distribution 
of the periods of trajectories following the theory devel- 



oped by Grebogi, Oft, and Yorke [ |14| ] for chaotic systems 
with numerical roundoff. 

First let us consider the mapping connecting two con- 
secutive points on the loss surface. Also we can divide the 
loss surface into N attractive cells. Window values falling 
into the same cell will foUow the same trajectory later on. 
Now suppose that we iterate the mapping n times, and 
the orbit visited none of the cells twice. At this point we 
choose a cell random from the n visited one, and try to 
calculate the probability that in step n + l the system vis- 
its the chosen cell. The following calculations assume that 



the dynamics of the map is mixing (see 1 14] for details). 
The result of mixing is that the typical value of n — j is 
large, where j is the iterate when the chosen cell was first 
visited. Thus ^becomes large, where / = n—j + 1, and bar 
denotes expectation value. Now the probability of repeat- 
ing the chosen cell in step n + 1 is equal to the probability 
of visiting that cell. On the other hand, due to random ini- 
tial conditions, the probability that a given cell is visited 



in step j is also equal to the probability of the attractor 
to visit that cell. Combining these, for the probability of 
repeating the cell in step n + l that was first visited in step 
j one obtains 



(P) 



(9) 



where pi is the probability with which the orbit visits the 
ith cell (i.e. the measure of the attractor in that cell). So 
the probability of repeating any of the previous cells in 
step n + l, supposing that the orbit did not repeat during 
the previous n steps is: 



Vrevin) =n{p). 



(10) 



Thus the probability that an orbit of length n has no re- 
peats is 



n-l 



n-1 



PnorepH = - Pr.p(.k)) = - Hp)). (11) 



k=l 



k=l 



One can approximate lnpnorep{n) for n{p) <C 1: 



n-l 



Inpnorep(n) = ^ ln(l - k{p)) 



k=l 
n-l 



(12) 



Y^k{p)^-n'{p)/2, 



k=l 



which yields 



Pnovepin) « exp[-(n^(p)/2)] 



(13) 



The condition for this approximation {n{p) <C 1) is jus- 
tified if l{p) <C 1. As we shall see, I ~ (p)^^"^, thus the 
above condition is equivalent to T » 1. The probability 
that an orbit goes n steps without repeat and then repeats 
on step (n + 1) is 



p{n) = Prepin) Pnovepin). 



(14) 



If we assume that the probability of returning to any of the 
preceding cells is equal, then if the first repeat occurs at 
step n + l, the probability that n — j + 1 = Hs 

, . / 1/n for / < n, 
^P("'^) = |0 for/>n. 

For large / the probability density of / becomes 



p{n) Pp{n, I) dn 



^{p)'''F({p)^lH), 



(16) 



vr 



8 



where we substituted the results above, and where 

/>oo 

F{y) = / exp(-xV2) dx. 

Calculating the expectation value of I, we obtain the av- 
erage length of periodic orbits generated by the discrete, 
cellular structure of the phase space: 



lP{l)dl = y^{p)-^/\ 



(17) 



In the next section we investigate the implications of the 
periodicity of the trajectories. 

VII. Exploring periodic orbits 

Until now three parallel TCPs were studied in our sim- 
ulation scenario. In such situation the number of cells 
which can be distinguished from each other in the phase 
space is in the order of ~ 10^^. Also, the length of a typ- 
ical periodic orbit is enormous and we do not expect to 
observe them in realistic simulations. 

Therefore, it seems more reasonable to find periodic or- 
bits when only two TCPs are operating in the network. 
For this scenario we used the same network model that 
we have introduced in Fig. |l| except that the TCP with the 
largest round-trip time was removed and the buffer size 
was set to i? = 50. 

Applying the tools developed in Sections III-^ we can 
study the chaotic properties of this system. First the 
Poincare surface of section is investigated (Fig. ^). Simi- 
larly to the three dimensional case, the sum of the conges- 
tion windows has to approximately satisfy the ^ • Wi = Q 
condition at packet loss times, where Q is the maximum 
number of packets on the lines and in the buffer. This 
condition defines the "line of loss" inside the two dimen- 
sional window phase space now. The detailed structure of 
the Poincare section is shown in the inset in Fig. ^ It can 
be seen that the Poincare section is not exactly a line but 
a narrow grid spreading around the ideal line. This shape 
can be explained by the packet drop mechanism at the bot- 
tleneck buffer. A detailed discussion of this mechanism is 



given in Section |VIII . 

To measure the complexity of the TCP dynamics in this 
situation we apply the symbolic coding that was intro- 
duced in Section IV. Then, the topological (Fig. |l^) and 
the Kolmogorov-Sinai (Fig. 11) entropies are estimated. 
For topological entropy Kq = 0.54 it 0.01, while for the 
KS entropy Ki = 0.440±0.002 have been obtained. Both 
values indicate strong mixing and the presence of chaos 
in the system. However, the maximum number of states is 
limited due to the discrete phase space, and TCP must en- 
ter into a periodic cycle after an initial transient period. If 
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Fig. 9 

Poincare section of the phase space. Congestion 
window values for the two tcps at packet loss times. a 
small part of the poincare section is zoomed to 
demonstrate the fine structure of the section. 



the orbit realized in the simulation is in fact periodic with 
some large period, then the number of existing length n 
sequences N{n) increases only linearly with n if n is suf- 
ficiently large. In case of two TCP this linear growth is 
observable above n = 15 (see the inset in Fig. 10). Also 
the entropy Ki{n) saturates above n = 15 and the KS 



entropy Ki goes to zero indicating that the long time be- 
havior of the system is periodic. 



A'n = 0.54 ± 0.013 
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Fig. 10 

Number of different symbols as the function of symbol 
length n for 2 tcp on a semi-logarithmic and on a 

normal plot in the inset. the fitted lines are 
N{n) = 1.387 X 1.718" AND N{n) = 606.3 xn~ 8034. 



In our 2-TCP scenario periodic orbits were identified. 
For the given simulation setup the length of the period 
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ROPYFOI 

Ki{n) = -E^IiPi lnP» = 0.440 x n + 0.081 



KOLMOGOROV ENTROPY FOR 2 TCP SCENARIO. 
^2" 



was found to be independent of the initial conditions. The 
length of the period was I = 1744. Using this and Eqs. (0) 



and (17) we can estimate the order of magnitude of the 
number of cells in this system. The cells can be indexed 
by the corresponding symbol sequences. Suppose that we 
can index uniquely all the cells with symbol sequences 
of length n*. In this case the number of cells is approxi- 
mately given by e^°"* . The probability of repeating a cell 
is then given by 



{p)= 22 P\Si,S2,... 



-K2n* 



(18) 



where K2 is the Renyi entropy for g = 2. Combining 
(ITtI), the estimate for the number of cells e^°" and (|l^ ) 
one obtains 



N 



Kqu* 



ip) 



-K0/K2 



On the other hand (17) implies 



and finally: 



' o \ K0/K2 



(19) 



(20) 



(21) 



Assuming that the obtained period I = 1744 is a good 
approximation of the average period and using the values 
Ko = 0.54 and K2 = 0.39 we get iV « 3 • 10^. This 
is in accordance with expectations, since the area in the 
phase space visited by the congestion window trajectories 
is approximately a triangle with area 90 x 90/2 as one 



can see in Fig. ^, while the area of a cell is approximately 
0.001^ in accordance with the findings of Fig. |[ This 
implies that the visited part of the phase space contains 
approximately 4 • 10^ cells. 



VIII. Toward chaos aware modeling 

In the previous sections we managed to collect all the 
basic elements of chaos in TCP congestion avoidance. 
Now we would like to show how we can integrate all these 
into a model without stochastic elements which is able to 
reproduce all the main features. 

The time evolution of congestion windows consists of 
two parts. The first one is when windows grow steadily. 
This part is sufficiently well described by existing fluid 
models like the one given by Eq. (^. In our case just for 
qualitative comparison we can use a version, where we 
assume that the buffer is non-empty: 



dwi 1 

~~dt " n + Ti + hP/Co' 



(22) 



where h is the actual number of packets in the buffer. To 
keep things simple, we estimate the number of packets 
in the system by the sum of congestion windows and the 
packets traveling in the lines by CqTq / P + \Cq{Ti + T2 + 
T^)/P. This way the queue length in the buffer is esti- 
mated as 



The second and more crucial part of the evolution is the 
dynamics at packet loss events. In our model we can sim- 
plify matters and say that a packet loss occurs whenever 
the buffer is full and h = B holds. At that time we halve 
and take the integer part of the congestion window of the 
TCP whose packet has been lost w[ = [wi/2], where [x] 
denotes the integer part of x. 

The elements of the model discussed so far do not gen- 
erate chaotic dynamics. Now, the really important new 
element is how we make decision on which TCP loses a 
packet at a given buffer overflow. In traditional stochas- 
tic models this step is random. The argument behind this 
is that the details of the packet flow are difficult to model 
on a macroscopic scale and if the packets are well mixed 
(which is a realistic assumption) then the chance of packet 
loss for each TCP is proportional with the rate a given 
TCP sends packets into the buffer. However in reality the 
packet loss process is more deterministic and predictable 
than it might seem at first sight. A packet is shifted out 
of the buffer at each time unit r = P/C, where P is 
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the packet size and C is the bandwidth of outgoing pack- 
ets. When the buffer is full then packet loss will not oc- 
cur until only one packet comes in within the time slot r. 
Packet loss does happen when more than one packets ar- 
rive in such a critical time slot. In congestion avoidance 
mode this can happen if two or more TCP send in pack- 
ets within the critical slot or if some of the TCPs sends 
in two packets. Single packets are sent in when a TCP 
receives an acknowledgement packet, double packets are 
sent in when in addition the congestion window crosses an 
integer number. If there are more than one packets sent in 
during the critical time slot then the one which will make 
it to the buffer is selected by the exact timing of the pack- 
ets. This phenomena is called phase effect and has been 
discovered by Floyd and Jacobson in Ref. [p3|]. Depend- 
ing on the concrete setup one can determine which TCP 
will be the winner out of the ones sending packets in the 
same slot. 

In our case the phase analysis shows that the winner 
TCP is accidentally the one with the largest delay. Con- 
sequently this TCP will win all the battles for the critical 
last buffer space. The only way this TCP can lose packets 
is when it sends in two packets. This is the reason behind 
the dominance of this TCP and its very regular behavior 
observable in Fig. |2(a)| . 

It is clear now that phase effects should be integrated 
into our model. The difficulty is that the fluid conges- 
tion window model is not able to predict when packets are 
sent out by TCPs. On the other hand, if we start modeling 
the concrete packet process of TCP then we are back at 
packet level modeling. So, we have to make a good com- 
promise which saves the simplicity of the fluid equations 
while keeps the key elements of the packet loss process in- 
cluding phase effects. This can be achieved if we attribute 
a packet process to the macroscopic fluid evolution of con- 
gestion windows. This can be done as follows. Each time 
the TCP sends out a packet its congestion window is in- 
creased by l/[w]. Such a way in each round trip time the 
window increases with 1 and [w] number of packets are 
sent out. In our simulation we can reverse this and we can 
calculate the packets sent out by the TCP from the actual 
macroscopic window changes. We can calculate the fluid 



window values using the differential equations (22) at the 
beginning of each time slot: 

Wi{n) = Wi{t = n ■ t). 

We can compare Wi{n + 1) and Wi{n) and see if the win- 
dow crosses in this time slot an integer multiple of 1/ [w] . 
(This can be done by checking if [wi{n + l)[ii;j(n)]] and 
[u;i(n)[tt;j(n)]] differ or not.) If it does then we assume 
that a packet has been sent out by the TCP. If the window 



crosses also an integer then we assume that two packets 
has been sent out in that time slot. This attributed packet 
process is relevant when the buffer becomes full and the 
windows reach the loss surface. At this point we should 
check the sending status of TCPs. If more than one pack- 
ets arrive during the time slot we let the winner TCP to fill 
the buffer space while the rest of the TCPs loose packets 
and halve their windows. 

We carried out such a simulation for our model system 
with three TCP flows. The results are in accordance with 
the results obtained from the packet level ns simulation. 
In Fig. 12 we can see that a loss surface similar to Fig. || 
is formed. 



Fig. 12 

POINCARE SECTION OF THE PHASE SPACE FOR THE MODEL 
DESCRIBED IN SECTION ' 



i|VII]|. Congestion window values 

FOR THE THREE TCPS AT PACKET LOSS TIMES. COLOURS 
DEPEND ON WHICH TCP LOST A PACKET. THE DEPICTED (PINK) 
SURFACE IS WHERE THE SUM OF CONGESTION WINDOWS IS 
APPROXIMATELY EQUAL WITH THE MAXIMUM NUMBER OF 
PACKETS WHICH CAN TRAVEL ON THE LINES OR STAY IN THE 
BUFFER. 



The fractal nature of the Poincare section is also pre- 
served. The fractal dimension of the projection is shown 



in Fig. [13. 

The model creates non-trivial symbolic dynamics sim- 
ilar to the one observed in the ns simulation. The mea- 
sured topological and KS entropies are shown in Figs. 14 
and m. 

This model can be further improved and extended by a 
more detailed model of the packet loss process including 
the time slots before the buffer reaches its critical state. 
The three TCPs are able to send in a maximum of 6 pack- 
ets during a single time slot and an overflow can happen in 
a buffer with 5 free packet slots. This effect can cause the 
broadening of the loss surface as in Fig. ^ Also the time 



11 



* , Do = 1.49 ± 0.03 



4.5 
4 
3.5 
„ 3 
2.5 
^ 2 
1.5 
1 

0.5 

-2 -1.5 -1 -0.5 0.5 1 1.5 2 2.5 

login («) 

Fig. 13 

Fractal dimension of the projection of loss events in 

THE phase space OBTAINED FROM THE MODEL. 
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MODEL. K„ = - In Pi = 0.753 xn + 0.108 



KOLMOGOROV-SlNAI ENTROPY FOR 3 TCP OBTAINED FROM THE 
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Fig. 14 

Topological entropy for 3 TCP obtained from the 

MODEL. 



elapsed between the packet loss event and its notification 
can be introduced and a more accurate description of the 
relationship between the round trip time and window val- 
ues is needed to obtain quantitatively better results. 

IX. Conclusions 

In this paper we investigated the chaotic properties 
of TCPs operating in congestion avoidance mode. We 
showed that chaotic behavior is general even in the case of 
low packet loss probability. We demonstrated that the dy- 
namics can be viewed as a smooth time evolution between 
packet losses and the relevant features of chaos might be 
described by the investigation of the Poincare section de- 
fined by packet loss events. Chaotic dynamics can be 
characterized by symbol sequences and we introduced 



topological and Kolmogorov-Sinai entropies, whose val- 
ues confirmed the hypothesis of chaos. Due to the de- 
terministic nature of the system, in contrast to stochastic 
models, not all possible symbol sequences are realized, 
some of them are excluded by the dynamics. Accord- 
ingly the topological entropy is significantly less than its 
possible maximal value In L. The positive Kolmogorov- 
Sinai entropy indicates that the Lyapunov exponent of the 
system is positive and that the distribution of symbol se- 
quence probabilities is multifractal. We also proved that 
due to the cellular structure of the phase space the long 
time behavior of the system is inherently periodic. 

To reproduce these phenomena, we introduced a sim- 
ple model that incorporates a fluid model of congestion 
window evolution but also handles packet losses in a de- 
terministic way. In spite of the extensive simplifications 
we used in the model the basic dynamical characteris- 
tics of the TCP dynamics such as the fractal dimension 
of the attractor and various entropies came out in a quali- 
tatively correct form. The model is also able to count for 
the apparent unfairness of TCP flows. What we demon- 
strated here is that instead of a stochastic model of packet 
loss events, a deterministic model can be worked out and 
combined with the macroscopic fluid equations. This new 
model is able to reproduce important phase effects. While 
the attributed packet generation is not fully identical with 
the packet generation of a packet level simulation, yet 
it generates packet flows whose statistical properties are 
close to the real situation. The model correctly reproduces 
the loss probabilities of packets arriving in near coinci- 
dence into the buffer. This property seems to be essential 
in generating a proper chaotic time evolution. 
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